Search CORE

110 research outputs found

Efficient Task-Local I/O Operations of Massively Parallel Applications

Author: Frings Wolfgang
Publication venue: Forschungszentrum Jülich GmbH Zentralbibliothek, Verlag
Publication date: 01/01/2016
Field of study

Applications on current large-scale HPC systems use enormous numbers of processing elements for their computation and have access to large amounts of main memory for their data. Nevertheless, they still need file-system access to maintain program and application data persistently. Characteristic I/O patterns that produce a high load on the file system often occurduring access to checkpoint and restart files, which have to be frequently stored to allow the application to be restarted after program termination or system failure. On large-scale HPC systems with distributed memory, each application task will often perform such I/O individually by creating task-local file objects on the file system. At large scale, these I/O patterns impose substantial stress on the metadata management components of the I/O subsystem. For example, the simultaneous creation of thousands of task-local files in the same directory can cause delays of several minutes. Also at the startup of dynamically linked applications, such metadata contention occurs while searching for library files and induces a comparably high metadata load on the file system. Even mid-scale applications cause in such load scenarios startup delays of ten minutes or more. Therefore, dynamic linking and loading is nowadays not applied on large HPC systems, although dynamic linking has many advantages for managing large code bases. The reason for these limitations is that POSIX I/O and the dynamic loader are implemented as serial components of the operating system and do not take advantage of the parallel nature of the I/O operations. To avoid the above bottlenecks, this work describes two novel approaches for the integration of locality awareness (e.g., through aggregation or caching) into the serial I/O operations of parallel applications. The underlying methods are implemented in two tools,

\textit{SIONlib}

and

\textit{Spindle}

, which exploit the knowledge of application parallelism to coordinate access to file-system objects. In addition, the applied methods also use knowledge of the underlying I/O subsystem structure, the parallel file system configuration, and the network betweenHPC-system and I/O system to optimize application I/O. Both tools add layers between the parallel application and the POSIX-based standard interfaces of the operating system for I/O and dynamic loading, eliminating the need for modifying the underlying system software. SIONlib is already applied in several applications, including PEPC, muphi, and MP2C, to implement efficient checkpointing. In addition, SIONlib is integrated in the performance-analysis tools Scalasca and Score-P to efficiently store and read trace data. Latest benchmarks on the Blue Gene/Q in Jülich demonstrate that SIONlib solves the metadata problem at large scale by running efficiently up to 1.8 million tasks while maintaining high I/O bandwidths of 60-80% of file-system peak with a negligible file-creation time. The scalability of Spindle could be demonstrated by running the Pynamic benchmark, a proxy benchmark for a real application, on a cluster of Lawrence Livermore National Laboratory at large scale. The results show that the startup of dynamically linked applications is now feasible on more than 15000 tasks, whereas the overhead of Spindle is nearly constantly low. With SIONlib and Spindle, this work demonstrates how scalability of operating system components can be improved without modifying them and without changing the I/O patterns of applications. In this way, SIONlib and Spindle represent prototype implementations of functionality needed by next-generation runtime systems

Publikationsserver der RWTH Aachen University

Juelich Shared Electronic Resources

Status of FZJ-IAS activities on the K computer

Author: Frings Wolfgang
Publication venue
Publication date: 01/01/2014
Field of study

Juelich Shared Electronic Resources

Extreme-scaling Applications 24/7 on JUQUEEN Blue Gene/Q

Author: Brömmel Dirk
Frings Wolfgang
Wylie Brian J. N.
Publication venue: 'IOS Press'
Publication date: 01/01/2016
Field of study

Jülich Supercomputing Centre has offered Extreme Scaling Workshops since 2009, with the latest edition in February 2015 giving seven international code teams an opportunity to (im)prove the scaling of their applications to all 458752 cores of the JUQUEEN IBM BlueGene/Q. Each of them successfully adapted their application codes and datasets to the restricted compute-node memory and exploit the massive parallelism with up to 1.8 million processes or threads. They thereby qualified to become members of the High-Q Club which now has over 24 codes demonstrating extreme scalability. Achievements in both strong and weak scaling are compared, and complemented with a review of program languages and parallelisation paradigms, exploitation of hardware threads, and file I/O requirements

Juelich Shared Electronic Resources

Техногенные месторождения, сформировавшиеся на объектах горнопромышленного производства в Хакасии

Author: Becker Michael
Franke Andreas
Frings Dorothee Annette
Hoffmann Rainer
Lepper Wolfgang
Mühler Eberhard Georg
Ocklenburg Christina
Schröder Jörg
Publication venue: Томский политехнический университет
Publication date: 01/01/2009
Field of study

Охарактеризованы некоторые техногенные объекты горнопромышленных предприятий Хакасии, являющиеся источником воздействия на окружающую среду и перспективные для изучения на предмет получения дополнительной продукции

Electronic archive of Tomsk Polytechnic University

A system level view of Petascale I/O on IBM Blue Gene/P

Author: A Gara
F Schmuck
IBM Blue Gene team
J Moreira
Michael Hennecke
O Mextorf
P Coteus
W Frings
W Gropp
Wolfgang Frings
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The calibration and evaluation of speed-dependent automatic zooming interfaces.

Author: Becker Daniel
Frings Wolfgang
Fürlinger Karl
Geimer Markus
Hermanns Marc-André
Mohr Bernd
Moore Shirley
Pfeifer Matthias
Szebenyi Zoltán Péter
Wolf Felix Gerd Eugen
Wylie Brian J. N.
Ábrahám Erika
Publication venue: University of Canterbury. Computer Science and Software Engineering
Publication date: 01/01/2004
Field of study

Speed-Dependent Automatic Zooming (SDAZ) is an exciting new navigation technique that couples the user's rate of motion through an information space with the zoom level. The faster a user scrolls in the document, the 'higher' they fly above the work surface. At present, there are few guidelines for the calibration of SDAZ. Previous work by Igarashi & Hinckley (2000) and Cockburn & Savage (2003) fails to give values for predefined constants governing their automatic zooming behaviour. The absence of formal guidelines means that SDAZ implementers are forced to adjust the properties of the automatic zooming by trial and error. This thesis aids calibration by identifying the low-level components of SDAZ. Base calibration settings for these components are then established using a formal evaluation recording participants' comfortable scrolling rates at different magnification levels. To ease our experiments with SDAZ calibration, we implemented a new system that provides a comprehensive graphical user interface for customising SDAZ behaviour. The system was designed to simplify future extensions---for example new components such as interaction techniques and methods to render information can easily be added with little modification to existing code. This system was used to configure three SDAZ interfaces: a text document browser, a flat map browser and a multi-scale globe browser. The three calibrated SDAZ interfaces were evaluated against three equivalent interfaces with rate-based scrolling and manual zooming. The evaluation showed that SDAZ is 10% faster for acquiring targets in a map than rate-based scrolling with manual zooming, and SDAZ is 4% faster for acquiring targets in a text document. Participants also preferred using automatic zooming over manual zooming. No difference was found for the globe browser for acquisition time or preference. However, in all interfaces participants commented that automatic zooming was less physically and mentally draining than manual zooming

UC Research Repository

VISIT - a Visualization Interface Toolkit, Version 1.0

Author: Eickermann Thomas
Frings Wolfgang
Publication venue: Forschungszentrum Jülich, Zentralinstitut für Angewandte Mathematik
Publication date: 01/01/2000
Field of study

With the increasing capabilities of both supercomputers and graphical workstations new modes of operation become feasible for numerical simulations that are traditionally performed in batch processing. Connecting a workstation to a compute-server allows for interactive monitoring (online-visualization) and control (computational steering, interactive simulation) of such simulations. Typical issues are the extracting of data and status information from a running simulation, the dynamically changing of parameters, the dynamically attaching to and detaching of the visualization from the simulation and the recording and replaying of simulation results.VISIT is a library that supports the development of interactive simulations. It provides functions for establishing a connection between a simulation and a visualization, exchanging data and eventually shutting down the connection again. VISIT is developed in the Central Institute for Applied Mathematics at the Research Centre Juelich.VISIT uses a simple client-server approach. That means that no central server or data manager is involved. Data is exchanged directly between a simulation (the client) and a visualization (the server). The only third party that comes into play is a directory server that is used for exchanging contact information.VISIT provides support for AVS/Express and Perl/Tk visualization systems and C, Fortran, and Perl language bindings

Juelich Shared Electronic Resources